Shubhorup Biswas comments on [missing post]

Shubhorup Biswas 19 Mar 2026 3:19 UTC
1 point
0
Can language models preserve their own alignment?
Wouldn’t consider this an argument for; rather a project proposal to empirically test how much models remain “good”